Mining Data Quality In Completeness

نویسندگان

  • Shouhong Wang
  • Hai Wang
چکیده

Completeness is an important attribute of data quality. This paper discusses the measures of completeness of data in a data set. It proposes a data mining model based on self-organizing maps (SOM) to visualize the patterns of missing values in a data set to assess the data quality in completeness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Assessment of the completeness of Volunteered Geographic Information focusing on building blocks data (Case Study: Tehran metropolis)

Open Street Map (OSM) is currently the largest collection of volunteered geographic data, widely used in many projects as an alternative to or integrated with authoritative data. However, the quality of these data has been one of the obstacles to the widely use of it. In this article, from among the elements related to the quality of volunteered geographic data, we have tried to examine the com...

متن کامل

مقایسه میزان رعایت عناصر کیفی کدگذاری بیماری ها و اقدامات در بیمارستان‌های آموزشی دانشگاه‌های علوم پزشکی ایران ، تهران و شهید بهشتی

Introduction: Because of importance of coded data in quality management activities, case-mix management, planning, marketing, research activities, fee-for-services initiatives, patient safety monitoring, the development of clinical decision support tools, and public health surveillance, observance of coding quality elements is necessary more than ever. Having thorough knowledge of the classific...

متن کامل

On Global Completeness of Event Logs

The field of process mining provides a collection of techniques and tools that aim to support the extraction of information out of event logs. This information may provide businesses insight into actual execution and performance of their business processes and may help identify ways of improving these processes. While the quality of the results of the application of mining algorithms depends on...

متن کامل

Analytical Comparison of Methods for Calculating the Completeness of VGI

Spatial data, which is one of the main needs of human societies from business organizations to the general users today, cannot meet the needs of a wide range of users without changing the structure of conventional methods of data registration and updating on a metropolitan scale. Open Street Map, as one of the most successful implementations of the crowdsourcing approach to spatial data with th...

متن کامل

The Effects and Interactions of Data Quality and Problem Complexity on Data Mining

Data quality remains a persistent problem in practice and a challenge for research. In this study we focus on four of the most important dimensions of data quality accuracy, completeness, consistency, and timeliness. Definitions and conceptual models for these dimensions have not been collectively considered with respect to data mining in general and a key determinant of data mining outcomes, p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007